Speaker independent acoustic modeling using speaker normalization

نویسندگان

Jun Ishii

T. Fukuda

چکیده

This paper proposes a novel speaker-independent (SI) modeling for spontaneous speech data from multiple speakers. The SI acoustic model parameters are estimated by individual training for inter-speaker variability and for intraspeaker phonetically related variation in order to obtain a more accurate acoustic model. The linear transformation technique is used for the speaker normalization to extract intra-speaker phonetically related variation and also is used for the re-estimation of inter-speaker variability. The proposed modeling is evaluated for a Japanese spontaneous speech data, using continuous density mixture Gaussian HMMs. Experimental results from the use of proposed acoustic model show that the reductions in word error rate can be achieved over the standard SI model regardless the type of acoustic model used.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speaker normalized acoustic modeling based on 3-D Viterbi decoding

This paper describes a novel method for speaker normalization based on a frequency warping approach to reduce variations due to speaker-induced factors such as the vocal tract length. In our approach, a speaker normalized acoustic model is trained using time-varying (i.e., state, phoneme or word dependent) warping factors, while in the conventional approaches, the frequency warping factor is xe...

متن کامل

Vocal Tract Length Normalization for Speaker Independent Acoustic-to-Articulatory Speech Inversion

Speech inversion is a well-known ill-posed problem and addition of speaker differences typically makes it even harder. This paper investigates a vocal tract length normalization (VTLN) technique to transform the acoustic space of different speakers to a target speaker space such that speaker specific details are minimized. The speaker normalized features are then used to train a feed-forward ne...

متن کامل

Speaker and gender normalization for continuous-density hidden Markov models

In this paper we describe a speaker-cluster normalization algorithm that we applied to both gendernormalization and speaker-normalization. To achieve parameter sharing the acoustic space is partitioned into classes. A maximum likelihood approach has been proposed under which the delta between the distribution mean and its corresponding acoustic class is mostly speaker-independent, whereas the m...

متن کامل

Speaker normalization through formant-based warping of the frequency scale

Speaker-dependent automatic speech recognition systems are known to outperform speaker-independent systems when enough training data are available to model acoustical variability among speakers. Speaker normalization techniques modify the spectral representation of incoming speech waveforms in an attempt to reduce variability between speakers. Recent successful speaker normalization algorithms ...

متن کامل

Text-independent speaker verification using virtual speaker based cohort normalization

In this paper, we propose a new score normalization method for text-independent speaker verification using GMM (Gaussian Mixture Model). In the proposed method, cohort model is designed as virtual speaker model based on the similarity of local acoustic information between the reference speaker and other customers. The similarity is determined using statistical distance between model components ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 1998

Speaker independent acoustic modeling using speaker normalization

نویسندگان

چکیده

منابع مشابه

Speaker normalized acoustic modeling based on 3-D Viterbi decoding

Vocal Tract Length Normalization for Speaker Independent Acoustic-to-Articulatory Speech Inversion

Speaker and gender normalization for continuous-density hidden Markov models

Speaker normalization through formant-based warping of the frequency scale

Text-independent speaker verification using virtual speaker based cohort normalization

عنوان ژورنال:

اشتراک گذاری